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Abstract 

Our aim in this paper is to point out a surprising formal connection, between two topics 
which seem on face value to have nothing to do with each other: relational database theory, 
{3JQ[ and the study of non-locality and contextuality in the foundations of quantum mechanics. We 

^ , shall show that there is a remarkably direct correspondence between central results such as 

Bell's theorem in the foundations of quantum mechanics, and questions which arise naturally 
and have been well-studied in relational database theory. 

1 Introduction 

Our aim in this paper is to point out a surprising formal connection, between two topics which 
seem on face vakie to have nothing to do with each other; 

• Relational database theory. 

• The study of non-locality and contextuality in the foundations of quantum mechanics. 

We shall show, using the unified treatment of the latter developed in 1 , that there is a 
remarkably direct correspondence between central results such as Bell's theorem in the foundations 
of quantum mechanics, and questions which arise naturally and have been well-studied in relational 
database theory. 

The present paper is meant to be an introduction to these two topics, emphasizing their 
common content, presented in a manner which hopefully will be accessible to readers without 
prior knowledge of either. 

How should this unexpected connection be interpreted? One idea is that the notion of contex- 
tuality is rather fundamental, and we can see some outlines of a common 'logic of contextuality' 
arising from this appearance of common structure in very different settings. 

Ideally, some deeper connections can also be found, leading to interesting transfers of results 
and methods. The present paper may at least serve to point the way in this direction. 



2 Relational Databases 
2.1 Review of basic notions 

We shall begin by reviewing some basic notions of relational database theory. 

We start with an example to show the concrete scenario which is to be formalized. 

Example 

Consider the following data table: 
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branch- name 


account-no 


customer- name 


balance 


Cambridge 


10991-06284 


Newton 


£2,567.53 


Hanover 


10992-35671 


Leibniz 


€11,245.75 











Let us anatomize this table. There are a set of attributes, 

{branch-name, account-no, customer-name, balance} 

which name the columns of the table. The entries in the table are 'tuples' which specify a value 
for each of the attributes. The table is a set of such tuples. A database will in general have a 
set of such tables, each with a given set of attributes. The schema of the database — a static, 
syntactic specification of the kind of information which can reside in the database — is given by 
specifying the set of attributes for each of the tables. The state of the database at a given time 
will be given by a set of tuples of the appropriate type for each of the tables in the schema. 
We now proceed to formalize these notions. 

We fix some set A which will serve as a universe of attributes. A database schema S over A 
is a family S = {^i, . . . , Ak} of finite subsets of A. 

At this — surprisingly early! — point, we come to an interesting juncture. The standard 
approach in much of the current relational database literature is to formalize the notion of tuple 
as an ordered n-tuple in 13" for some set D of data values; a relation is then a subset of D". This 
is motivated by the desire to make the connection to the standard notion of relational structure 
in first-order logic as direct as possible. This choice creates a certain distance between the formal 
notion of relation, and the informal notion of table; in practice this is not a problem. 

For our purposes, however, we wish to make a different choice: we shall formalize the notion of 
tuple, and hence of relation, in a fashion which directly refiects the informal notion. As we shall 
see, this will have both mathematical and conceptual advantages for our purposes. At the same 
time, there will be no real problem in relating this formalism to the standard one in the literature. 
Note that the style of formalization we shall use is also commonly found in the older literature on 
relational databases, see e.g. pS] . 

We shall assume that for each a G A there is a set Da of possible data values for that at- 
tribute. Thus for example the possible values for customer-name should be character strings, 
perhaps with some lexical constraints; while for balance the values should be pairs (currency, 
amount), where currency comes from some fixed list (£, €, ...), and amount is a number. 
These correspond to domain integrity constraints in the usual database terminology. 

Given A e E, we define the set of A-tuples to be IlaeA^a- Thus an A-tuple is a function 
which assigns a data value in Da to each a € A. 

In our example above, the first tuple in the table corresponds to the function 

{branch-name i— > Cambridge, account-no i-> 10991—06284, customer-name H> Newton, 

balance £2, 567.53} 

A relation of type A is a set of A-tuples. Given a schema S, an instance of the schema, 
representing a possible state of the database, is given by specifying a relation of type A for each 

Operations on relations 

We consider some of the fundamental operations on relations, which play a central role in relational 
databases. Firstly, relations of type A live in the powerset T^dlaGA ^a), which is a boolean algebra; 
so boolean operations such as union, intersection, and set difference can be applied to them. 

At this point, we note that our assumption that each attribute a G A has a given set of possible 
data values is not entirely innocent. It means that boolean operations such as set complement 
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of relations have an 'absolute', i.e. instance- independent meaning. By contrast, in the standard 
view there is no fixed universe of values for instances. An instance specifies its own universe U as 
well as the intepretations of the database relations as sets of ordered tuples Ri C C/"* over this 
universe. 

While this difference in formalization could have some interesting consequences, we shall not 
pursue this here. 

Next, we consider the operation of projection. In the language of A-tuples, projection is 
function restriction. That is, given an A-relation R, and a subset B (- A, we define: 

R\B {t\B : teR). 

Here, since t e Haeyi -^o' J^^^ means restriction of the function t to S, which is a subset of 
its domain. This operation is then lifted pointwise to relations. 

Now we consider the independent combination of relations, which is cartesian product in the 
standard formalism. The representation of tuples as functions leads to a 'logarithmic shift' in the 
representatior{3, whereby this operation is represented by disjoint union of attribute sets. 

Given an A-relation R and a i3-relation S, we form the disjoint union AU B, and the AU B- 
relation 

R(»S -.^ {t€ W Da : t\A€ R h t\B C S}. 

aeAUB 

Of course, as concrete sets A and B may overlap. We can force them to be disjoint by 'tagging' 
them appropriately, e.g. 

AUB {0} X A U {1} X B. 

The minor housekeeping details of such tagging can safely be ignored^ We shall henceforth do so 
without further comment. 

This is only a subset of the operations available in standard relational algebra [25] . A more 
complete discussion could be given in the present setting, but this will sufhce for our purposes. 

2.2 The functorial view 

We shall now show how the relational database formalism, in the style we have developed it, has 
a direct expression in functorial terms. This immediately brings a great deal of mathematical 
structure into play, and will allow us to relate some important database notions to concepts of 
much more general standing. 

We shall assume the rudiments of the language of categories, functors and natural transforma- 
tions. All the background we shall need is covered in the charming (and succinct) text pi] . 

We shall consider the partial order Att of finite subsets of A, ordered by inclusion, as a 
category. 

We shall define a functor T : Att°'' >- Set where T{A) is the set of A-tuples. Formally, we 

define 

T{A) := n^a, 

aeA 

and \i A(- B, we define the restriction map : T{B) T{A) by 

PA-t^-^ t\A. 

It is easy to verify functoriality of T, which means that, whenever A C B C C, 

Pa°P%^ Pa. 

and also that ~ id^. Thus 7~ is a presheaf, and restriction is exactly function restriction. 

^Think of 2"2'' = 2"+'', and hence log(xy) = \og{x) + log(y). 

^The relevant result is the coherence theorem for monoidal categories |19| . 
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We also have the covariant powerset functor V : Set >- Set, which acts on functions by 

direct image: if / : X Y , then 

Vf : VX VY -.-.S^ {f{x) : xeS}. 

We can compose V with T to obtain another presheaf 

Tl—VoT: Att°P Set. 

This presheaf assigns the set of ^-relations to each set of attributes A; while the restriction map 

is exactly the operation of relation restriction, equivalent to the standard notion of projection in 
relation algebra, which we defined previously: 

Pa-R^ R\A. 

Natural Join 

One of the most important operations in relational algebra is natural join. Given an A-relation R 
and a B-relation S, we define an {A U i?)-relation Rxi S: 

Rt<S := {t& ]J Da : t\A&R A t\B &S}. 

We shall now show how this operation can be characterized in categorical terms. 

Note firstly that since the powerset is naturally ordered by set inclusion, we can consider TZ as 
a functor 

n : Att°P Pos 

where Pos is the category of posets and monotone maps. Pos is order-enriched; given monotone 
maps /, gf : P — )• Q, we can define the pointwise order: 

f<g = \/x&Rf{x)<g{x). 

Now suppose we are given attribute sets A and B. We consider the following diagram arising from 
the universal property of product in Set. 

n{A) ^ n{A) X 7^(s) ^ n{B) 




n{A U B) 

Proposition 2.1 The natural join cxi: '1Z{A) x '1Z{B) TZ{A U B) is uniquely characterized as 
the left adjoint of {p^^ , p^^) ; that is, as the unique map satisfying 
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The fact that in general a relation R G Tl{A U B) satisfies only 

R C R\A>:iR\B, 

with strict inclusion possible, corresponds to the fact that natural join is in general a 'lossy' 
operation. Lossless joins correspond exactly to the case when equality holds. 

2.3 The sheaf-theoretic view 

We shall now show, building on the presheaf structure described in the previous sub-section, how a 
number of important database notions can be interpreted geometrically, in the language of sheaves 
and presheaves. 

Schemas as covers and gluing conditions 

We shall interpret a schema I] — {Ai, . . . , Ak} of finite subsets of ^ as a cover. That is, we think 
of the attribute sets Ai as 'open sets' expressing some local information in the sense of related 
clusters of attributes; these sets cover A := IjjL]^ Ai, the global set of attributes for the schema. 
Conversely, we think of the global set A as being decomposed into the local clusters Ai; which is 
exactly the standard point of view in databases. 

The basic idea of sheaf theory is to analyze the passage from local to global behaviour in 
mathematical structures. A number of important notions in databases have exactly this character, 
and can be described naturally in sheaf-theoretic terms. 

An instance (i?i, . . . ,Rk) of a schema S is given by specifying a relation Ri £ Tl{Ai) for each 
Ai e S. In sheaf-theoretic language, this is a family of local sections, defined over the open sets in 
the cover. A central issue in geometric terms is whether we can glue these local sections together 
into a global section defined over A := [Ji^^ Ai. 

More precisely, we can ask whether there exists a relation R e Ti-iA) such that R\Ai = Ri, 
i = 1, . . . ,k. 

This has been studied as an algorithmic question in database theory; it is shown in |14) that 
it is NP-complete. 

Note that a necessary condition for this to hold is that, for all i, j — 1, . . . ,k: 

R,\Air\Aj ^Rj\A,r\Aj. (1) 

Indeed, if such an R exists, then 

R^\A, n Aj = {R\A^)\Ai n Aj = R\A, n Aj, 

using the functoriality of restriction, and similarly for Rj\Ai D Aj. 

We shall say that a database instance {Ri, . . . , Rk) for which this condition ([T]) holds has 
consistent projections, and refer to the family of relations in the instance as a compatible family. 

These notions can be generalized to apply to any presheaf. If the gluing condition can always 
be satisfied, for any cover and any family of compatible elements, and moreover there is a unique 
element which satisfies it, then the presheaf is a sheaf. 

It is of course a well-known fact of life in databases, albeit expressed in a different language, 
that our relational presheaf TZ is not a sheaf. 

In fact, we have the following: 

Proposition 2.2 An instance (Ri, . . . , Rk) satisfies the gluing condition if and only if there is a 
universal relation R for the instance. 

Here we take a universal relation for the instance by definition to be a relation defined on 
the whole set of attributes from which each of the relations in the instance can be recovered by 
projection. This notion, and various related ideas, played an important role in early developments 
in relational database theory; see e.g. [HI [HI [HI 1^1^ . 
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Thus the standard notion of universal relation in databases corresponds exactly to the standard 
notion of solution to the gluing condition in sheaf theory, for the particular case of the relational 
presheaf TZ. 

It is also standard that a universal relation need not exist in general, and even if it exists, it 
need not be unique. There is a substantial literature devoted to the issue of finding conditions 
under which these properties do hold. 

There is a simple connection between universal relations and lossless joins. 

Proposition 2.3 Let {Ri, . . . , Rk) be an instance for the schema S = {Ai, . . . , A^} . Define 
R := ixi'il^-^ Ri. Then a universal relation for the instance exists if and only if R\Ai — Ri, 
i — 1, . . . , k, and in this case R is the largest relation in 7?.(1J,- Ai) satisfying the gluing condition. 

Proof We note that, if a relation S satisfies S\Ai — Ri, i — 1, . . . then S C cxijL-^ Ri by 
the adjoint property of the natural join. Moreover, since projection is monotone, in this case 
R, C S\A, C Ri)\A, C R,. □ 

There are further categorical aspects of relational databases which it might prove interesting to 
pursue. In particular, one can define categories of schemas and of instances and their morphisms, 
and the construction of colimits in these categories may be applicable to issues of data integration. 
However, we shall not pursue these ideas here. Instead, we will turn to a natural generalization of 
relational databases which arises rather effortlessly from the formalism we have developed to this 
point. 



3 Algebraic Databases 

We begin by revisiting the definition of the relational presheaf TZ in terms of the covariant powerset 
functor v. An alternative presentation of subsets is in terms of characteristic functions. That is, 
we have the familiar isomorphism ViX) = 2^, where 2 := {0, 1} is the 2-element boolean algebra. 

We can also use this representation to define the functorial action of powerset. Given s : X ^ 2 
and f :X -^Y, we define f*{s):Y ^2 by 

r(s):y^ V s{x). (2) 

f{x)=y 

It is easy to see that this is equivalent to 

f*{s){y) = l ^ 3xeS.f{x) = y. 

Here S is the subset of X whose characteristic function is s. 

The advantage of this 'matrix' style of definition of the powerset is that it can immediately be 
generalized rather widely. There is a minor caveat. In the above definition, we used the fact that 
2 is a complete boolean algebra, since there was no restriction on the cardinality of the preimages 
of /. In the database context, of course, all sets are typically finitelfl We shall enforce a finiteness 
condition explicitly in our general definition. 

We recall that a commutative semiring is a structure (i?, +, 0, •, 1), where {R, +, 0) and (R, •, 1) 
are commutative monoids, and moreover multiplication distributes over addition: 

X ■ {y + z) = X ■ y + x ■ z. 

Many examples of commutative semirings arise naturally in Computer Science: we list a few of 
the most common. 

• The reals 

(R,+,0, x,l). 

More generally, any commutative ring is a commutative semiring. 

^The sets of data values Da may be infinite, but only finitely many values will appear in a database instance. 



6 



• The non-negative reals 



(K>o,+,0,x,l)- 



• The booleans 

2 = ({0,1},V,0,A,1). 

More generally, idempotent commutative semirings are exactly the distributive lattices. 

• The min-plus semiring 

(M>o U {oo}, min, oo, +, 0). 

We also note the role played by provenance semirings in database theory [TTl [6l [8], 

We fix a semiring R. Given a set X, the support of a function v : X R is the set x E X 
such that v{x) ^ 0. We write supp(u) for the support of v. We shall write Vn{X) for the set of 
functions v : X R oi finite support. We shall write VniX) for the subset of Vb.{X) of those 
functions d : X R such that 

Note that the finite support condition ensures that this sum is well-defined. 

We shall refer to elements of Vr{X) as i?-valuations on X, and of 'Dfj{X) as i?-distributions. 
We consider a few examples: 

• If we take R = 2, then Vu{X) is the set of finite subsets of X, and is the set of finite 
non-empty subsets. 

• If we take R = (Ili>o, +,0, x, 1), then 'Dji{X) is the set of discrete (finite-support) probability 
distributions on X . 

Algebraically, Vii{X) is the free i?-semimodule over the set X [TU]. 

These constructions extend to functors on Set. Given / : X — ^ F, we define 

VR{f):VR{X)^VR{Y)::v^[y^ E ^(^)]- 

f{x)=y 

This restricts to Vr in a well-defined fashion. Taking R — 2, we see that VrXJ) is exactly the 
direct image of /, defined as in ([2]). 

In fact, Vr and Vr are both monads on Set; Vr is commutative, and Vr is affine ^71 [TB] . 

We can now generalize databases from the standard relational case to 'relations valued in a 
semiring' by replacing V by Vr (or T>r) in our definition of TZ] that is, we take TZ := F oT , where 
F is Vr or TDr for some commutative semiring R. We recover the standard notion exactly when 
R = 2. In the case where R = (M>o, +, 0, x, 1) and F — Vr, we obtain a notion of probabilistic 
database, where each relation specifies a probability distribution over the set of tuples for its 
attribute-set. 

We also note an important connection between probabilistic and relational databases. We can 
always pass from a probabilistic to a relational instance by taking the support of the distribution. 
Algebraically, this corresponds to mapping all positive probabilities to 1; this is in fact the action 
of the unique semiring homomorphism from the non-negative reals to the booleans. 

In general, many natural properties of databases will be preserved by this homomorphic map- 
ping. This means that if we show that such a property is not satisfied by the support, we can 
conclude that it is not satisfied by the probabilistic instance. Thus we can leverage negative results 
at the relational level, and lift them to the probabilistic setting. 
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4 From databases to observational scenarios 



We shall now offer an alternative interpretation of the relational database formalism, with a very 
different motivation. This will expose a surprising connection between database theory, and on 
face value a completely different topic, namely Bell's theorem in the foundations of quantum 
mechanics [5]. 

Our starting point is the idealized situation depicted in the following diagram. 




There are several agents or experimenters, who can each select one of several different mea- 
surements to perform, and observe one of several different outcomes. These agents may or may 
not be spatially separated. When a system is prepared in a certain fashion and measurements are 
selected, some corresponding outcomes will be observed. These individual occurrences or 'runs' of 
the system are the basic events. Repeated runs allow relative frequencies to be tabulated, which 
can be summarized by a probability distribution on events for each selection of measurements. 
We shall call such a family of probability distributions, one for each choice of measurements, an 
empirical model. 

As an example of such a model, consider the following table. 





(0,0) 


(1,0) 


(0,1) 


(1,1) 


a 


b 


1/2 








1/2 


a' 


b 


3/8 


1/8 


1/8 


3/8 


a 


b' 


3/8 


1/8 


1/8 


3/8 


a' 


b' 


1/8 


3/8 


3/8 


1/8 



The intended scenario here is that Alice can choose between measurements a and a', and Bob 
can choose 6 or 6'; we assume that these choices are made independently. Thus the measurement 
contexts are 

{a, 6}, {a',b}, {a,b'}, {a',b'}, 

and these index the rows of the table. Each measurement has possible outcomes or 1. 

Note that, with a small change of perspective, we can see this in database terms. Take the 
global set of attributes A = {a, a', b, b'}, and consider the schema 

!]:= {{a,b}, {a',b}, {a,b'}, {a',b'}). 

For each a € ^, we take Da := {0, 1}. 

For each A G H, we have a 'table' in the algebraically generalized sense discussed in the 
previous section. That is, we have a distribution dA & V^oTiA), where R = R>o is the semiring 
of non- negative reals. Thus dA is a probability distribution on T{A), the set of A-tuples. 

To make a direct connection with standard relational databases, we can pass to the support of 
the above table, which yields the following: 
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(0,0) 


(1,0) 


(0,1) 


(1,1) 


a 


b 


1 








1 


a' 


b 


1 


1 


1 


1 


a 


b' 


1 


1 


1 


1 


a' 


b' 


1 


1 


1 


1 



This corresponds to the instance of the schema S where for each A = {a, /?} G S \ {a, b}, there is 
the 'full' table of all possible tuples: 



a 


/3 











1 


1 





1 


1 



while for {a, b} we have the table with only two tuples: 



a 


b 








1 


1 



Thus we have a formal passage between empirical models and relational databases. To go 
further, we must understand how empirical models such as these can be used to draw striking 
conclusions about the foundations of physics. 

5 Empirical models and hidden variables 

Most of our discussion is independent of any particular physical theory. However, it is important 
to understand how quantum mechanics, as our most highly confirmed theory, gives rise to a class 
of empirical models of the kind we have been discussing. 

To obtain such a model, we must provide the following ingredients: 

• A quantum state. 

• For each of the 'measurement settings', which correspond to attributes in database terms, a 
physical observable or measurable quantity. Each such observable will have a set of associated 
possible outcomes, which will correspond to the set of data values associated with that 
attribute. 

The 'statistical algorithm' of quantum mechanics will then prescribe a probability for each mea- 
surement outcome when the given state is measured with that observable. 

Although we shall not really need the details of this, we briefly recall some basic definitions. 
For further details, see e.g. [53] . 

A crash course in qubits 

Whereas a classical bit register has possible states or 1, a qubit state is given by a superposition 
of these states. More precisely, a (pure) qubit state is given by a complex linear combination 
ao|0) + q;i|1), subject to the normalization constraint |aoP + — 1. 
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Measurement of such a state (in the |0), |1) basis) is inherently probabilistic; we get \i) with 
probability \ai\'^. 

There is a beautiful geometric picture of this complex 2-dimensional geometry in real three- 
dimensional space. This is the Bloch sphere representation: 




(b) 




The pure qubit states correspond to points on the surface of the sphere. However, this onc- 
qubit case does not yet provide non-classical resources for information processing. Things get 
interesting with n-qubit registers 



ze{0,iy 



It is at this point, in particular, that entanglement phenomena arise. 
A typical example of an entangled state is the Bell state: 

loo) + 111) 



We can think of two particles, each with a qubit state, held by AUce and Bob. However, these 
two particles are entangled. If Alice measures her qubit, then if she gets the answer |0), the state 
will collapse to |00), and if Bob measures his qubit, he will get the answer |0) with certainty; 
similarly if the result of Alice's measurement is |1). This non-local effect creates new possibilities 
for quantum information processing. 

Mathematically, compound systems are represented by the tensor product, Hi ^H^, with 
typical element 

i 

Superposition encodes correlation. 

Entanglement is the physical phenomenon underlying Einstein's 'spooky action at a distance'. 
Even if the particles are spatially separated, measuring one has an effect on the state of the other. 

Bell's achievement was to turn this puzzling feature of quantum mechanics into a theorem: 
quantum mechanics is essentially non-local. 



5.1 Bell's theorem 

We look again at the empirical model 





(0,0) 


(1,0) 


(0,1) 


(1,1) 


(a, 6) 


1/2 








1/2 


(a, 6') 


3/8 


1/8 


1/8 


3/8 


(a', 6) 


3/8 


1/8 


1/8 


3/8 


(a', 6') 


1/8 


3/8 


3/8 


1/8 
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This can be realized in quantum mechanics, using a Bell state 



|00) + 111) 
V2 ' 

subjected to measurements in the XF-pIane of the Bloch sphere, at relative angle 7r/3. Systems 
of this kind have been the subject of extensive experimental investigation, and the predictions of 
quantum mechanics can be taken to be very highly confirmed. 

The question we shall ask, following Bell, is this: Can we explain these empirical findings by 
a theory which is local and realistic in the following sense. 

• A theory is realistic if it ascribes definite values to all observables for every physical state, 
independently of the activities of any external observers. 

• A theory is local if the outcomes of measurements on spatially separated subsystems depend 
only on common causal factors. In particular, for space-like separated measurements, the 
outcomes of the measurements should be independent of each other. 

We allow for the fact that there may be salient features in the theory determining the outcomes 
of measurements of which we are not aware. These features are embodied in the notion of hidden 
variable. Thus we take measurement outcomes to be determined, given some value of this hidden 
variable. Moreover, we assume that this hidden variable acts in a local fashion with respect to 
spatially separated subsystems. 

This gives a general notion of theory which behaves in a fashion broadly consistent with classical 
physical intuitions. The import of Bell's theorem is exactly that no such theory can account for 
the empirical predictions of quantum mechanics. Hence, given that these predictions are so well- 
confirmed, we must abandon the classical world- view which underpins the assumptions of local 
realism. 

To give a precise statement of Bell's theorem, we must formalize the notion of local hidden 
variable theory. We shall give this in a streamlined form, which can be shown to be equivalent to 
more general definitions which have been considered (see e.g. Theorem 7.1 in [T]). 

We shall explain this notion in relation to the Bell table given above. We have a total set of 
four measurement settings we are considering, two for Alice and two for Bob: 

{a,a',b, b'}. 

A simultaneous assignment of outcomes (0 or 1) to each of these is given by a function 

s : {a,a',b,b'} ^ {0,1}. 

The fact that an (unknown) hidden variable may be affecting the outcome is captured by saying 
that we have a probability distribution d on the set of all such functions s. Such a probability 
distribution can be taken to be a canonical form for a hidden variable. 

The requirement on this distribution d to be consistent with the empirical data is that, for 
each of the experimentally accessible combinations of measurement settings 

{a,b}, {a',b}, {a,b'}, {a',b'}, 

the restriction (or marginalization) of d to this set of measurements yields exactly the observed 
distribution on outcomes from the corresponding row of the table. For example, we must have 
d\{a, b} = di, where 

di(0,0) - di(l, 1) - 1/2, di(0, 1) = di(l,0) - 0. 
A precise statement of a particular instance of Bell's theorem can now be given as follows: 

Proposition 5.1 There is no distribution d on the whole set of measurements which yields the 
observable distributions by restriction. 
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Proof Assume for a contradiction that such a distribution d exists. It will assign a number 

Xi S [0, 1] to each Si : {a, a', b, b'} >- {0, 1}. There are 16 such functions: we enumerate them 

by viewing them as binary strings, where the j'th bit indicates the assignment of an outcome to 
the j'th measurement, listed as a, a', b, b' . 

The requirement that this distribution projects onto the distributions in the empirical model 
translates into 16 equations, one for each entry in the table. It suffices to consider 4 of these 
equations: 





+ X2 - 


y- ^3 


+ Xi 


= 1/2 


X2 


+ Xi - 


V Xe 


+ Xs 


= 1/8 


Xs 


+ Xi - 


- Xii 


+ Xi2 


= 1/8 


Xi 


+ ^5 - 


h Xg 


+ Xi3 


= 1/8 



Adding the last three equations yields 

Xi+X2+X3 + 2Xi + X5+X6+Xs+XQ+Xn+Xi2+Xi3 = 3/8. 

Since all these terms must be non-negative, the left-hand side of this equation must be greater 
than or equal to the left-hand side of the first equation, yielding the required contradiction. □ 

This argument seems very specific to the probabilistic nature of the empirical model. However, 
an important theme in the work on no-go theorems in quantum mechanics is to prove results of this 
kind in a probability- free fashion [T^l [13] ■ This will bring us directly into the arena of relational 
databases. 

5.2 Hardy's construction 

Hardy's construction |13j yields a family of empirical models which can be realized in quantum 
mechanics in similar fashion to the Bell model. However, these families exhibit a stronger form of 
non-locality property, which does not depend on the probabilities, but only on the support. 
We exhibit an example of a support table arising from Hardy's construction. 





(0,0) 


(1,0) 


(0,1) 


(1,1) 


(a, 6) 


1 


1 


1 


1 


ia',b) 





1 


1 


1 


(a, 6') 





1 


1 


1 


{a',b') 


1 


1 


1 






This arises from a probability table by replacing all positive probabilites by 1. 

Note that we can view this table as encoding a small relational database, as in our discussion 
in the previous section. There will be four relation tables in this database, one for each of the 
above rows. The table corresponding to the first row will have the full set of tuples over {0, 1}. 
The tables for the second and third rows will have the form 



a 







1 


1 





1 


1 



while that for the fourth row will have the form 
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a' 


b' 











1 


1 






The property which shows the non-locahty of this niodel is the exact relational analogue of the 
probabilistic version we considered in relation to the Bell model. 

Proposition 5.2 There is no A-relation R, where A — {a,a',b,b'}, which is consistent with the 
empirical observable supports; that is, for which R\{a, /3} yields the relational table for all {a, /?}, 
a e {a, a'}, (3 £ {6,6'}. 

In database language, this says exactly that there is no 'universal relation' on the whole set of 
attributes which yields each of the 'observable relations' by projection. 

Proof We argue similarly to the case of the Bell model, except that we are now working over the 
Boolean semiring rather than the non-negative reals. The existence of a relation R thus reduces 
to a Boolean satisfiability problem. An equation Xi = 1 simply asserts the disjunction of the 
boolean variables, while an equation Xi = asserts the conjunction of the negated variables. 
Again it suffices to consider four of the equations which can be read off the Hardy table: 



Xi 


V 


X2 


V 


^3 


V 


Xi 




A 




A 


^X^ 


A 


-^Xi 




A 


~^X2 


A 


-^Xq 


A 


~^XlQ 


~^Xi 


A 


-^Xg, 


A 


~^Xi2 


A 


~^Xie 



Since every disjunct in the first formula appears as a negated conjunct in one of the other three 
formulas, there is no satisfying assignment. □ 

There is a precise sense in which the Hardy result is stronger than the Bell result. In fact, we 
have the following. 

Proposition 5.3 // an empirical model has a local hidden-variable model in the probabilistic 
sense, then its support table has a universal relation. Thus failure to have a universal relation 
implies failure to have a local hidden-variable model in the probabilistic sense. 

Proof This follows simply from the fact that the map from the non-negative reals to the 
booleans which takes all non-zero elements to 1 is a semiring homomorphism. □ 

The converse to this result is false. For example, the support table arising from the Bell model 
does have a universal relation, as can easily be verified. 

Note that the Hardy table, and indeed all such tables arising from quantum mechanics, satisfies 
the compatibility condition which we discussed in Section 12.31 In fact, compatibility corresponds 
precisely to the physical condition of no-signalling, and the fact that quantum models satisfy 
the condition is exactly the content of the No-Signalling theorem of quantum mechanics. See [T] 
Section 8 for an extended discussion of this point. 

6 No-Go theorems, global sections and universal relations 

We shall now develop a more general perspective on the results we have discussed in the previous 
section. 

Following the geometric language we introduced in Section 12.31 we see that the existence of 
a hidden-variable model is equivalently expressed as the existence of a global section which glues 
together the family of empirical accessible distributions or relations. 
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Thus non-locality and related no-go results can be understood in terms of obstructions to the 
existence of global sections, a central issue in the pervasive applications of sheaves in geometry, 
topology, analysis and number theory. 

In terms of databases, such results can be understood as expressing obstructions to the existence 
of universal relations for given instances of the database. 

We shall now discuss two further types of no-go results, which can be understood in terms of 
yet stronger forms of obstruction. 

6.1 Strong Contextuality 

If we consider the argument for the Hardy construction, it can be understood as saying that there 
is no relation over the global tuples which 'covers' all (and only) the observable tuples. But now 
suppose we consider a much weaker requirement: we simply ask for one global tuple which projects 
consistently into all the relations in the database instance. 

Note that the Hardy model does meet this condition. The global assignment 

{a 1, a' h-> 0, 6 1, b' ^ 0} 

does project consistently into the support table for this model. The Bell model similarly meets 
this condition. 

If even this, much weaker requirement cannot be met, then we have a much stronger form of 
no-go theorem. We say that such a situation exhibits strong contextuality. 

The question now arises: are there models coming from quantum mechanics which are strongly 
contextual in this sense? 

We shall now show that the well-known GHZ models [T^], of type (rt,2,2) for all n > 2, are 
strongly contextual. This will establish a strict hierarchy 

BeU < Hardy < GHZ 

of increasing strengths of obstructions to non-contextual behaviour for these salient models. 

The GHZ model of type (n, 2, 2) can be specified as follows. We label the two measurements 
at each part as X^^'^ and F^*-', and the outcomes as and 1. For each context C, every s in the 
support of the model satisfies the following conditions: 

• If the number of Y measurements in C is a multiple of 4, the number of I's in the outcomes 
specified by s is even. 

• If the number of Y measurements is 4fc -I- 2, the number of I's in the outcomes is odd. 
A model with these properties can be realized in quantum mechanics, using the GHZ state 

I0---0) + 

Proposition 6.1 The GHZ models are strongly contextual, for all n > 3. 

Proof We consider the case where n = 4k, k > 1. Assume for a contradiction that we have a 
global section s G Se for the GHZ model e. 

If we take Y measurements at every part, the number of 1 outcomes under the assignment is 
even. Replacing any two Y^s by X's changes the residue class mod 4 of the number of Y^s, and 
hence must result in the opposite parity for the number of 1 outcomes under the assignment. Thus 
for any F^*-', F^^^ assigned the same value, if we substitute X's in those positions they must receive 
different values under s. Similarly, for any Y^''\ y^^^ assigned different values, the corresponding 
must receive the same value. 

Suppose firstly that not all F*^*^ are assigned the same value by s. Then for some i, j, k, Y^^'> 
is assigned the same value as Y'-^^ and Y'-^^ is assigned a different value to Y^''\ Thus F^*^ is 
also assigned a different value to F^'^^ Then X'*^ is assigned the same value as X'^''\ and X'^^^ is 
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assigned the same value as X'^''\ By transitivity, X*^'^ is assigned the same value as X'^^\ yielding 
a contradiction. 

The remaining cases are where all Y^s receive the same value. Then any pair of X's must 
receive different values. But taking any 3 X's, this yields a contradiction, since there are only two 
values, so some pair must receive the same value. 

The case when n = 4fc + 2, fc > 1, is proved in the same fashion, interchanging the parities. 
When n > 5 is odd, we start with a context containing one X, and again proceed similarly. 

The most familiar case, for n = 3, does not admit this argument, which relies on having at 
least 4 y's in the initial configuration. However, for this case one can easily adapt the well-known 
argument of Mermin using 'instruction sets' i22, to prove strong contextuality. This uses a case 
analysis to show that there are 8 possible global sections satisfying the parity constraint on the 
3 measurement combinations with 2 K's and 1 X; and all of these violate the constraint for the 
XXX measurement. □ 

6.2 The Kochen-Specker theorem 

Kochen-Specker-type theorems 1151 can be understood as generic strong contextuality results. In 
database terms, they say that, if the database schema has a certain combinatorial structure, then 
every instance satisfying some conditions is strongly contextual. This can be interpreted in the 
quantum context in such a way that the conditions will be satisfied by every quantum state, and 
hence we obtain a state-independent form of strong contextuality result. 

The condition which is typically imposed on the instances, assuming that the possible data 
values for each attribute lie in {0, 1}, is that every tuple contains exactly one 1. If we think in 
terms of satisfiability, this corresponds to a 'POSITIVE ONE-IN-fc-SAT' condition. 

To show that the Kochen-Specker result holds is exactly to show that there is no satisfying 
assignment for the corresponding set of clauses. 

The simplest example of this situation is the 'triangle', i.e. the schema with elements 

{a,b},{b,c},{a,c}. 

However, this example cannot be realized in quantum mechanics [1]. 

An example which can be realized in quantum mechanics, where A has 18 elements, and there 
are 9 sets in the database schema, each with four elements, such that each element of A is in two 
of these, appears in the 18-vector proof of the Kochen-Specker Theorem in [7]. 





U2 


U3 


C/4 


U5 


Ue 


U7 


Us 


U9 


A 


A 


H 


H 


B 


I 


P 


P 


Q 


B 


E 


I 


K 


E 


K 


Q 


R 


R 


C 


F 


C 


G 


M 


N 


D 


F 


M 


D 


G 


J 


L 


N 





J 


L 






Here the schema is S = {Ui, . . . , C/g}. 

We shall give a simple combinatorial condition on the schema E which is implied by the 
existence of a global section s satisfying the 'POSITIVE ONE-IN-fc-SAT' condition. Violation of 
this condition therefore suffices to prove that no such global section exists. 

For each a S ^, we define 

T,{a) := {AeT, : ae A}. 



Proposition 6.2 // a global section satisfying the condition exists, then every common divisor 
of {\'E{a)\ : a G A} must divide |S|. 
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Proof Suppose there is a global section s : ^ — > {0, 1} satisfying the condition. Consider the 
set X C A those a such that s{a) — 1. Exactly one element of X must occur in every A S S. 
Hence there is a partition of S into the subsets S(a) indexed by the elements of X. Thus 

|S| = 5] |S(a)|. 

It follows that, if there is a common divisor of the numbers |I](a)|, it must divide jSj. □ 

For example, if every a d A appears in an even number of elements of E, while S has an odd 
number of elements, then there is no global section. This corresponds to the 'parity proofs' which 
are often used in verifying Kochen-Specker-type results [71[57]. For example, in the 18-attribute 
schema with 9 relations given above, each attribute appears in two relations in the schema; hence 
the argument applies. 

For further discussion of these ideas, including connections with graph theory, see [I]. 

7 Further Directions 

We mention some further directions for developing the connections between databases and the 
study of non-locality and contextuality in quantum mechanics. 

• We may consider conditions on the database schema which guarantees that global sections 
can be found. The important notion of acydicity in database theory [i] seems relevant here. 
We also note on the probabilistic side a result by Vorob'ev (motivated by game theory), 
which in effect gives necessary and sufficient combinatorial conditions on a schema for any 
assignment of probability distributions on the tuples for each relation in the schema to have 
a global section; that is, for a universal relation in the probabilistic sense to always exist for 
any probabilistic instance of the database. 

• A logical approach to Bell inequalities in terms of logical consistency conditions is developed 
in [2. . It would be interesting to interpret and apply this notion of Bell inequalities in the 
database context. 

• The tools of sheaf cohomology are used to characterize the obstructions to global sections 
in a large family of cases in [5^. In principle, these sophisticated tools can be applied to 
databases. There may be interesting connections with acyclicity in the database sense. 
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